# Multi-precision quantization
Unireason Qwen3 14B RL GGUF
Apache-2.0
A static quantization version of UniReason-Qwen3-14B-RL, suitable for text generation and mathematical reasoning research scenarios.
Large Language Model
Transformers English

U
mradermacher
272
1
Echelon AI Med Qwen2 7B GGUF
This project provides the GGUF quantized file for the Echelon-AI/Med-Qwen2-7B model, supported by Featherless AI, aiming to enhance model performance and reduce operating costs.
Large Language Model
E
featherless-ai-quants
183
1
Fpham Sydney Overthinker 13b HF GGUF
This project provides optimized GGUF quantized files, which can significantly improve model performance. These quantized files are supported by Featherless AI. Users can run any desired model by paying a small fee.
Large Language Model
F
featherless-ai-quants
133
1
Deepseek Ai DeepSeek R1 0528 GGUF
MIT
DeepSeek-R1-0528 is a large language model that has been quantized to optimize its running efficiency on different hardware.
Large Language Model
D
bartowski
2,703
6
A M Team AM Thinking V1 GGUF
Apache-2.0
Llamacpp imatrix quantized version based on a-m-team/AM-Thinking-v1 model, supporting multiple quantization types, suitable for text generation tasks.
Large Language Model
A
bartowski
671
1
Llama 2 7b Chat Hf GGUF
Llama 2 is a 7B-parameter large language model developed by Meta, offering multiple quantization versions to accommodate different hardware requirements.
Large Language Model English
L
Mungert
1,348
3
Lacia Sum Small V1 GGUF
Lacia_sum_small_v1 is a statically quantized model based on the T5 architecture, primarily designed for text summarization tasks. It supports Russian and English, making it suitable for natural language processing applications.
Text Generation Supports Multiple Languages
L
mradermacher
312
1
GIGABATEMAN 7B GGUF
GIGABATEMAN-7B is a 7B-parameter large language model based on the Mistral architecture, focusing on text generation tasks.
Large Language Model English
G
mradermacher
115
3
Xwin LM 13B V0.1 GPTQ
Xwin-LM 13B V0.1 is a large language model based on the Llama2 architecture, developed by the Xwin-LM team. The model responds to user queries in a professional, detailed, and courteous manner, making it suitable for conversational scenarios.
Large Language Model
Transformers

X
TheBloke
868
18
Featured Recommended AI Models